A parallel tag affinity computation for social tagging systems using MapReduce

نویسندگان

  • Hyunwoo Kim
  • Taewhi Lee
  • Hyoung-Joo Kim
چکیده

Tag affinity is the relationship between tags. It is a useful information for search and recommendation in social tagging systems. Tag affinity is measured by several types of tag cooccurrence frequency. The computation of tag affinity is a time-consuming task as the tagging information is accumulated. To alleviate this problem, we propose a parallel tag affinity computation method using MapReduce. We present MapReduce algorithms for computing three types of tag affinity measures: macro, micro, and bigram tag cooccurrence frequency. Our experimental results show that the proposed MapReduce-based approach not only significantly outperforms existing methods based on a relational database but also provides high scalability. To the best of our knowledge, this approach is the first tag affinity computation on MapReduce.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parallelization of Maximum Entropy POS Tagging for Bahasa Indonesia with MapReduce

In this paper, MapReduce programming model is used to parallelize training and tagging proceess in maximum entropy part of speech tagging for Bahasa Indonesia. In training process, MapReduce model is implemented dictionary, tagtoken, and feature creation. In tagging process, MapReduce is implemented to tag lines of document in parallel. The training experiments showed that total training time u...

متن کامل

A Computational Model for Mapreduce Job Flow

Massive quantities of data are today processed using parallel computing frameworks that parallelize computations on large distributed clusters consisting of many machines. Such frameworks are adopted in big data analytic tasks as recommender systems, social network analysis, legal investigation that involve iterative computations over large datasets. One of the most used framework is MapReduce,...

متن کامل

A Personalized Tag-Based Recommendation in Social Web Systems

Tagging activity has been recently identified as a potential source of knowledge about personal interests, preferences, goals, and other attributes known from user models. Tags themselves can be therefore used for finding personalized recommendations of items. In this paper, we present a tag-based recommender system which suggests similar Web pages based on the similarity of their tags from a W...

متن کامل

High-throughput Gene Tagging in Trypanosoma brucei

Improvements in mass spectrometry, sequencing and bioinformatics have generated large datasets of potentially interesting genes. Tagging these proteins can give insights into their function by determining their localization within the cell and enabling interaction partner identification. We recently published a fast and scalable method to generate Trypanosoma brucei cell lines that express a ta...

متن کامل

Why do Users Tag? Detecting Users' Motivation for Tagging in Social Tagging Systems

While recent progress has been achieved in understanding the structure and dynamics of social tagging systems, we know little about the underlying user motivations for tagging, and how they influence resulting folksonomies and tags. This paper addresses three issues related to this question: 1.) What motivates users to tag resources, and in what ways is user motivation amenable to quantitative ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IJBDI

دوره 1  شماره 

صفحات  -

تاریخ انتشار 2014